Non-natural consensus albumin binding domains

ABSTRACT

Non-natural albumin binding domains, polynucleotides encoding thereof and methods of making and using these domains and polynucleotides are useful in controlling the half-life of therapeutic molecules for patients.

CROSS-REFERENCE TO RELATED APPLICATIONS

This application claims the benefit of U.S. Provisional Application Ser. Nos. 61/651,642, filed 25 May 2012 and 61/776,918, filed 12 Mar. 2013, the entire contents of which are incorporated herein by reference in their entireties.

FIELD OF THE INVENTION

The present invention relates to albumin binding domains and methods of making and using them. More particularly, the present invention is directed to a non-natural albumin binding domain consensus sequence and variants thereof as described herein.

BACKGROUND OF THE INVENTION

Rapid elimination of biotherapeutic molecules via renal clearance contributes to limited clinical effectiveness or more frequent dosing for the patient. Renal clearance due to glomerular filtration is most associated with smaller biotherapeutics, as the rates of kidney filtration are greatly reduced for molecules with a molecular weight of greater 50,000 daltons (Kontermann, Curr Opin Biotechnol 22:868-76, 2011). Several approved biotherapeutic drugs contain active portions that on their own fall below the filtration limit and are thus cleared quickly. To overcome this limitation, a number of technologies have been introduced to effectively increase the size of the therapeutic molecule to reduce kidney filtration and resulting half-life.

PEGylation (PEG) of therapeutics is an effective way to increase the hydrodynamic radius of the protein and reduce glomerular filtration. One or several PEG chains can be coupled to the protein most commonly through conjugation to free thiol or amine groups on the protein surface. PEGylated versions of Adenosine deaminase, L-Asparaginase, Interferon alpha-2b, G-CSF, Human Growth Hormone, Erythropoietin, Uricase, and an anti-TNFalpha antibody fragment have all been approved for human therapy (Kontermann, Curr Opin Biotechnol 22:868-76, 2011). Limitations of PEGylation include production of heterogeneous products and difficulty in controlling the number of PEG molecules attached to certain proteins. PEGylation introduces additional conjugation as well as purification steps to the production of therapeutic proteins, resulting in decreased yields and increased costs of goods. PEGylation may also lead to renal tubular vacuolization in animals and patients as PEG chains are non-degradable in the kidneys (Gaberc-Porekar et al., Curr Opin Drug Discov Devel 11:242-250, 2008).

Coupling a therapeutic to an antibody Fc region to generate Fc-fusion proteins can be used to increase the serum half-life of therapeutic molecules. Immunoglobulins may exhibit long half-lives on the order of several weeks in humans due to their large size and recycling through FcRn (Kuo et al., J Clin Immunol 30:777-789, 2010). TNF receptor 2, LFA-3, CTLA-4, IL-1R, and TPO-mimetic peptide molecules are all approved therapies produced as Fc-fusions (Kontermann, Curr Opin Biotechnol 22:868-76, 2011). Fc-fusion proteins are not ideal for all therapeutic classes for several reasons. The homodimeric nature of the Fc region results in the production of a dimeric therapeutic protein, possibly leading to cellular activation due to receptor clustering. Fc-fusions must also be made in mammalian expression systems which may be more costly than prokaryotic systems.

In addition to Fc, albumin exhibits a long half-life in vivo due to FcRn recycling. At a concentration of approximately 40 g/L, Human Serum Albumin (HSA) is the most abundant protein found in the blood. FcRn recycling leads to a long half-life of approximately 19 days in humans. Additionally, biodistribution studies suggest that albumin may distribute within the body to areas important for targeting disease, such as inflamed joints or tumors (Wunder et al., J Immunol 170:4793-4801, 2003). Thus, the serum half-life of a number of proteins has been increased by producing them as either C-terminal or N-terminal fusions to HSA. Successful fusions include interferon alpha (Flisiak and Flisiak, Expert Opin Biol Ther 10:1509-1515, 2010), human growth hormone (Osborn et al., Eur J Pharmacol 456:149-158, 2002), tumor necrosis factor (Muller et al., Biochem Biophys Res Commun 396:793-799, 2010), coagulation factor IX (Metzner et al., Thromb Haemost 102: 634-644, 2009), coagulation factor VIIa (Schulte, Thromb Res 122 Suppl 4: S14-19, 2008), insulin (Duttaroy et al., Diabetes 54:251-258, 2005), urokinase (Breton et al., Eur J Biochem 231:563-569, 1995), hirudin (Sheffield et al., Blood Coagul Fibrinolysis 12:433-443, 2001), and bispecific antibody fragments (Muller et al, J Biol Chem 282:12650-12660, 2007). HSA fusion proteins may have long serum half-lives, however large scale production of these fusion proteins is limited predominantly to yeast expression systems. Additionally, the large size of HSA may lead to a loss in activity of the therapeutic due to steric hindrance.

Therapeutic proteins may also be produced as fusion proteins to peptides or proteins that bind to serum albumin in the blood stream to increase their half life. Such albumin binding peptides include cysteine-constrained peptides or antibody fragments to albumin. Expression of a Fab antibody fragment as a fusion to cysteine-constrained peptides significantly increased the serum half-life of the Fab (Dennis et al., J Biol Chem 277:35035-35043, 2002; US2004/0253247A1). Coupling cysteine-constrained peptide to an antibody fragment led to better peak tumor accumulation and more homogeneous tumor distribution compared to Fab and mAb molecules targeting the same antigen (Dennis et al., Cancer Res 67:254-261, 2007; US2005/0287153A1). Further, a number of antibody fragments that bind specifically to albumin have been coupled to therapeutic moieties to increase the half life of the therapeutic. A camelid V_(HH) antibody fragment (Nanobodies®) that binds to HSA was fused to another Nanobody® that binds to TNF-alpha (Coppieters et al., Arthritis Rheum 54: 1856-1866, 2006) or anti-EGFR Nanobodies® (Tijink et al., Mol Cancer Ther 7:2288-2297, 2008). Anti-albumin domain antibodies (dAbs) have been generated that bind to albumin, and have been fused to, for example, interleukin-1 receptor (Holt et al., Protein Eng Des Sel 21:283-288, 2008) and interferon alpha 2b (Walker et al., Protein Eng Des Sel 23:271-278, 2010) to improve their half life.

A number of naturally occurring protein domains from bacteria are known to interact with albumin, presumably to help such bacteria distribute throughout the host organism. These are 3-helix bundle protein domains approximately 6 kDa in size which use one face of the 3-helix bundle to interact with serum albumin (Cramer et al., FEBS Lett 581:3178-3182, 2007; Lejon et al., Acta Crystallogr Sect F Struct Biol Cryst Commun 64:64-69, 2008; Johansson et al., FEBS Lett 374:257-261, 1995; Johansson et al., J Mol Biol 266:859-865, 1997; Johansson et al., J Biol Chem 277:8114-8120, 2002). One such albumin binding domain derived from streptococcal protein G (Jonsson et al., Protein Eng Des Sel 21:515-527, 2008), has been most widely used to extend the serum half-life of proteins. Fusion to this domain has been shown to increase the half-life of soluble complement receptor type 1 (Makrides et al., J Pharmacol Exp Ther 277:534-542, 1996), a bispecific antibody (Stork et al., Protein Eng Des Sel 20:569-576, 2007), CD4 (Nygren et al., Vaccines 91:363-368, 1991; U.S. Pat. No. 6,267,964B1), Pf155/RESA (Stahl et al., J Immunol Methods 124:43-52, 1989), G-CSF (Frejd, F. PEGS Europe, Oct. 5, 2010), and affibody molecules binding to a number of targets (Andersen et al., J Biol Chem 286:5234-5241, 2011) (Frejd, F. PEGS Europe, Oct. 5, 2010). However, antibody production against the domain has been reported in patients and thus the use of the molecule for therapeutic applications may be challenging (Goetsch et al., Clin Diagn Lab Immunol 10:125-132, 2003; Libon et al., Vaccine 17:406-414, 1999).

A number of protein domains or peptides that bind to albumin are capable of extending the serum half-life and producing a more beneficial biodistribution pattern of therapeutic proteins. In order to use these albumin binding domains in therapeutic applications, a number of biophysical requirements need to be fulfilled, such as high expression levels in a host, solubility and stability, and minimal immunogenicity. The albumin binding moiety should bind to serum albumin with an affinity that effectively balances serum half-life and biodistribution with activity of the therapeutic moiety when bound and not bound to albumin.

SUMMARY OF THE INVENTION

One aspect of the invention is a protein comprising an isolated, non-natural albumin binding domain having the amino acid sequence of SEQ ID NO:21. Another aspect of the invention is an isolated non-natural albumin binding domain comprising an amino acid sequence at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 21.

Yet another aspect of the invention is an isolated non-natural albumin binding domain comprising an amino acid sequence of SEQ ID NO: 21 having substitutions at 1, 2, 3, 4, 5, and/or 6 residues, and, preferably, wherein the substitutions at 1, 2, 3, 4, 5, and/or 6 residues may occur at amino acid positions Y21, Y22, L25, K30, T31, E33, G34, A37, L38, E41, I42 and/or A45 of SEQ ID NO: 21 or at amino acid positions Y21, Y22, K30, T31, A37, and/or E41 of SEQ ID NO: 21.

A further aspect of the invention is an isolated non-natural albumin binding domain comprising an amino acid sequence: LKEAKEKAIEELKKAGITSDX₁X₂FDLINKAX₃X₄VEGVNX₅LKDX₆ILKA (SEQ ID NO: 22); wherein X₁, X₂, X₃, X₄, X₅, and X₆ can be any amino acid or a subset of certain amino acids.

In a further aspect of the invention, the isolated non-natural albumin binding domain comprises an extension of 5 amino acids at its N-terminus.

Another aspect of the invention is a method of making a non-natural albumin binding domain of the invention comprising providing a polynucleotide encoding the non-natural albumin binding domain; expressing the polynucleotide in a host or in vitro; and recovering the non-natural albumin binding domain. Another aspect of the invention is an isolated polynucleotide encoding the albumin binding domains of the invention. Another aspect of the invention is an isolated polynucleotide comprising a polynucleotide of SEQ ID NO: 35.

Another aspect of the invention is an isolated vector comprising the isolated polynucleotide of the invention and a host cell comprising the isolated vector of the invention.

Another aspect of the invention is a fusion protein comprising an albumin binding protein of the invention and a bioactive agent.

Another aspect of the invention is a pharmaceutical composition comprising the fusion protein of the invention and at least one pharmaceutically acceptable carrier or diluent.

BRIEF DESCRIPTION OF THE DRAWING

FIG. 1 shows SDS-PAGE analysis of purified ABDCon. Samples are as follows lane 1) SeeBlue plus 2 maker, 2) total cell lysate, 3) soluble cell lysate, 4) column flowthrough, 5-12) eluted fractions. Molecular weights of some of the marker bands are shown on the left.

FIG. 2 shows electro-spray ionization mass spectrometry of purified ABDCon sample.

FIG. 3 shows size exclusion chromatography analysis of purified ABDcon as run in PBS.

FIG. 4A shows the melting temperature and FIG. 4B shows the reversibility of ABDCon unfolding as measured by DSC in PBS. The normalized, baseline subtracted data for the first scan is shown in FIG. 4A. After the first scan, the sample was cooled to 20° C. and the scan repeated to determine the reversibility of folding. The raw data traces for the first and second scans are overlain in FIG. 4B.

FIG. 5 shows the pharmacokinetics of a Tencon25-ABDCon fusion protein in mice when dosed at 2 mg/kg intravenously.

FIG. 6 shows the pharmacokinetics of a Tencon25 (residues 1-90 of SEQ ID NO: 39) molecule fused to ABDCon (SEQ ID NO: 21), ABDCon3 (SEQ ID NO: 26), ABDcon5 (SEQ ID NO: 28), ABDCon7 (SEQ ID NO: 30) and ABDCon9 (SEQ ID NO: 32) in mice when dosed at 2 mg/kg intravenously.

FIG. 7A shows stability of several FN3 domainABDCon fusion proteins when incubated at 37° C. for 0 days (left panel) or 28 days (right panel) in PBS as assessed on SDS-PAGE. Left panel lines from left to right: Lane 1: molecular weight marker, Lane 2: Tencon-ABDCon, Lane 3: Tencon-ABDCon, Lane 4: Tencon25-ABDCon, Lane 5: 83-ABDCon, Lane 6: 71-ABDCon fusion proteins. Right panel lanes: Lane 1: molecular weight marker, Lane 2: Tencon-ABDCon4C, Lane 3: Tencon-ABDCon, Lane 4: Tencon25-ABDCon, Lane 5:83-ABDCon, Lane 6: 71-ABDCon fusion proteins. FIG. 7B shows stability of FN3 domain-ABDCon12 having an extended N-terminal helix after 0, 1, 2, 3 or 4 week incubation (as indicated above the lanes) at 37° C. in PBS.

DETAILED DESCRIPTION OF THE INVENTION

The term “albumin binding domain” or “domain” as used herein refers to a polypeptide that binds albumin in vivo or in vitro. Albumin may be derived from any animal species, for example human, monkey, or rodent.

The term “K_(D),” as used herein, refers to the dissociation constant between albumin and the albumin binding domain.

The term “K_(on),” as used herein, refers to the on rate constant for association of an albumin binding domain to albumin to form an albumin binding domain/albumin complex.

The term “K_(off),” as used herein, refers to the off rate constant for dissociation of an albumin binding domain from the albumin binding domain/albumin complex.

The term “non-natural” as used herein refers to a domain that is synthetic, i.e., having an amino acid sequence not present in native polypeptides.

The term “substituting” or “substitutions” as used herein refers to altering, deleting or inserting, or to alterations, deletions or insertions of one or more amino acids or nucleotides in a polypeptide or polynucleotide sequence to generate a variant of that sequence.

The term “variant” as used herein refers to a polypeptide or a polynucleotide that differs from a reference polypeptide or a reference polynucleotide by one or more modifications, for example, substitutions, insertions or deletions.

The term “bioactive agent” as used herein refers to proteins, antibodies, peptides, nucleotides, small molecular pharmaceuticals and the like, that, when administered to an animal patient provides a benefit to that patient. Synthetically produced, naturally derived or recombinantly produced moieties are included in this term. Bioactive agents may be analogs, derivatives, agonists, antagonists, enantiomers or pharmaceutically acceptable salts of bioactive agents.

The term “stability” as used herein refers to the ability of a molecule to maintain a folded state under physiological conditions such that it retains at least one of its normal functional activities, for example, half life.

The term “vector” means a polynucleotide capable of being duplicated within a biological system or that can be moved between such systems. Vector polynucleotides typically contain elements, such as origins of replication, polyadenylation signal or selection markers, that function to facilitate the duplication or maintenance of these polynucleotides in a biological system. Examples of such biological systems may include a cell, bacteria, virus, animal, plant, and reconstituted biological systems utilizing biological components capable of duplicating a vector. The polynucleotide comprising a vector may be DNA or RNA molecules or a hybrid of these.

The term “expression vector” means a vector that can be utilized in a biological system or in a reconstituted biological system to direct the translation of a polypeptide encoded by a polynucleotide sequence present in the expression vector.

The term “operably linked” as used herein refers to a positioning of components such that they function in their intended manner.

Amino acids are referred herein using their standard three or one letter codes:

Albumin Binding Domain Compositions

The present invention provides a synthetic albumin binding domain (ABDCon) (SEQ ID NO: 21) and variants thereof ABDCon can be operably linked to a bioactive agent for enhancement of serum half-life and biodistribution of the therapeutic agent. ABDCon and variants thereof can be expressed at high levels in E. coli, are soluble, and have high thermal stability. The present invention provides polynucleotides encoding ADBCon and variants thereof, complementary nucleic acids, vectors, host cells, and methods of making and using them.

The present invention further provides synthetic albumin binding domain (ABDCon) that has an extension of 5 amino acid extension at the N-terminus. The extension improves stability of ABDCon.

The ABDCon binding domain was designed by calculating a consensus amino acid sequence of certain 3-helix bundle albumin binding domain (ABD) sequences deposited in the non-redundant protein database using ABD from Streptococcus sp. G148 protein G (SEQ ID NO: 1) as a template, and selecting the most prevalent amino acid at each sequence position (Table 6). ABDCon has a high affinity to human albumin with a K_(D) of 75 pM and K_(off) of 3.02×10⁻⁵ l/s when tested with conditions specified herein, and therefore bioactive agents operably linked to ABDCon may be largely bound to albumin once administered to an animal patient. In a human patient, molecules binding serum albumin too weakly will have short serum half-life due to renal filtering (Hopp et al., Protein Eng Des Sel 23:827-834, 2010), whereas molecules binding serum albumin too tightly will not be released from albumin at the preferred site of action, and thus in some cases may have reduced ability to modulate activity of the desired target and provide a therapeutic benefit. It is therefore one aspect of the invention to have and be able to generate ABDCon variants and binding domains having a spectrum of affinities to albumin and hence provide the ability to modulate the half life of the bioactive agent operably linked to ABDCon variants and binding domains.

One embodiment of the invention is an isolated non-natural albumin binding domain comprising an amino acid sequence at least 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99% or 100% identical to SEQ ID NO: 21 (ABDCon): LKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKA.

Another embodiment of the invention is an isolated albumin binding domain comprising an amino acid sequence of SEQ ID NO: 21 having substitutions at 1, 2, 3, 4, 5, or 6, residues.

ABDCon variants can be designed by examining the crystal structure of an exemplary 3-helix bundle albumin binding protein in complex with albumin and making an assumption that ABDCon may bind albumin in a manner similar to the exemplary protein. An exemplary crystal structure that can be utilized is that of a GA module (protein G-related albumin-binding module) of protein PAB of an anaerobic bacterium Finegoldia magna (formerly Peptostreptococcus magnus) in complex with human albumin (Protein Data Base (PDB) code 1TF0 (Leion et al., J Biol Chem 279:42924-42928, 2004).

ABDCon variants having decreased affinity for albumin can be designed by various strategies, such as by disrupting predicted hydrophobic contacts, disrupting predicted pi-stacking between aromatic residues, introducing steric clashes by substitution with larger amino acids, disrupting salt bridges by removal of charged residues, and disrupting hydrogen bonding predicted to occur between ABDCon and albumin. Introduced changes are designed to decrease binding affinity without changing the binding surface in a way that would abolish binding. For example, residue Y21 can be substituted for charged amino acids (Lys, Arg, Asp, Glu) or smaller amino acids (Ala, Gly) to reduce hydrophobic interactions between this residue and albumin residues V325 and F326. In addition, Y21 of ABDCon forms a hydrogen bond with the backbone of albumin residues N318 and D324 such that minor changes such as mutation to Phe may slightly weaken interactions. Residue Y22 of ABDCon is predicted to form hydrophobic as well as pi-stacking interactions with albumin residues F309 and F326. Therefore substitution of Y22 for smaller neutral amino acids Ala, Ser, Val or charged amino acids Lys, Arg, Asp, or Glu may decrease hydrophobic contacts and reduced affinity of the ABDCon variant to albumin. Residue K30 in ABDCon is predicted to form a salt-bridge with albumin residue E227 and thus K30 can be substituted for Asp or Glu to introduce repulsive charges and potentially reduce ABDCon affinity to albumin. Mutation to any non-charged amino acid may also reduce affinity by eliminating the salt-bridge. ABDCon residue T31 is predicted to form an intermolecular hydrogen bond with albumin residue N267 and substitutions for Ala or Gly can be used to disrupt the intermolecular hydrogen bond without introducing a large steric clash that might significantly destabilize the interaction. ABDCon residue A37 can be substituted for Val, Tyr, or other larger amino acid in order to introduce steric clashes. Residue E41 can be substituted for Gln or Asn to remove charge. Introduction of positively charged residues such as Lys or Arg can be expected to further reduce binding affinity. ABDCon residues L25, E33, G34, L38, I42, and A45 are predicted to form direct contact with albumin, and substitutions at these residues are likely to modulate ABDCon affinity to albumin Residue positions refer to ABDCon of SEQ ID NO: 21 and human albumin of SEQ ID NO: 36.

Alternatively, a random cocktail of amino acids can be used, utilizing for example NNK codons for substitutions at identified positions, and the resulting variants are measured for their binding to albumin using standard methods and methods described herein.

Exemplary ABDCon variants are variants having substitutions in at least one residue selected from Y21, Y22, L25, K30, T31, E33, G34, A37, L38, E41, I42 and A45 of SEQ ID NO: 21.

Exemplary ABDCon variants are variants having substitutions in at least one residue selected from Y21, Y22, K30, T31, A37 and E41 of SEQ ID NO: 21.

An exemplary ABDCon variant comprises an amino acid sequence LKEAKEKAIEELKKAGITSDX₁X₂FDLINKAX₃X₄VEGVNX₅LKDX₆ILKA (SEQ ID NO: 22; wherein X₁, X₂, X₃, X₄, X₅, and X₆, can be any amino acid.

In other embodiments, an exemplary ABDCon variant comprises and amino acid sequence LKEAKEKAIEELKKAGITSDX₁X₂FDLINKAX₃X₄VEGVNX₅LKDX₆ILKA (SEQ ID NO: 23), wherein

-   -   i) X₁ is Lysine (K), Arginine (R), Aspartate (D), Glutamate (E),         Alanine (A), Glycine (G), Phenylalanine (F) or Tyrosine (Y);     -   ii) X₂ is Alanine (A), Serine (S), Valine (V), Lysine (K),         Arginine (R), Aspartate (D), Glutamate (E) or Tyrosine (Y);     -   iii) X₃ is Aspartate (D), Glutamate (E) or Lysine (K);     -   iv) X₄ is Alanine (A), Glycine (G) or Threonine (T);     -   v) X₅ is Valine (V), Tyrosine (Y) or Alanine (A); and     -   vi) X₆ is Glutamine (Q), Asparagine (N), Lysine (K),         Arginine (R) or Glutamate (E).

In other embodiments, an exemplary ABDCon variant comprises and amino acid sequence LKEAKEKAIEELKKAGITSDX₁X₂FDLINKAX₃X₄VEGVNX₅LKDX₆ILKA (SEQ ID NO: 24), wherein

-   -   i) X₁ is Lysine (K), Alanine (A) or Tyrosine (Y);     -   ii) X₂ is Alanine (A), Serine (S), Valine (V) or Tyrosine (Y);     -   iii) X₃ is Aspartate (D) or Lysine (K);     -   iv) X₄ is Alanine (A) or Threonine (T);     -   v) X₅ is Valine (V), Tyrosine (Y) or Alanine (A); and     -   vi) X₆ is Glutamine (Q) or Glutamate (E).

Additional exemplary ABDCon variants comprise amino acid sequences shown in SEQ ID NOs: 25-34. ABDCon variants are tested for albumin binding using well known methods, for example in an in vitro assay using plasmon resonance (BIAcore, GE-Healthcare Uppsala, Sweden). The measured affinity of a particular ABDCon variant/albumin interaction can vary if measured under different conditions (e.g., osmolarity, pH). Thus, measurements of affinity and other binding parameters (e.g., K_(D), K_(on), K_(off)) are preferably made with standardized solutions of ABDCon variant and albumin, and a standardized buffer, such as the buffer described herein. Affinity of ABDCon variants to albumin may range from at least about 1×10⁻⁵ M, at least about 1×10⁻⁶ M, at least about 1×10⁻⁷ M, at least about 1×10⁻⁸ M, at least about 1×10⁻⁹ M, at least about 1×10⁻¹⁰ M, m at least about 1×10⁻¹¹ M, at least about 1×10⁻¹² M, or at least about 1×10⁻¹³ M. For example, various substitutions at Y22 of ABDCon (SEQ ID NO: 21) reduced affinity of the variants to albumin about 300-1,000 fold depending on a substitution (Table 4). Additional variants having substitutions at defined positions or at combinations of positions can be designed and generated by those skilled in the art and tested for a desired albumin binding affinity using routine methods.

ABDCon and variants thereof can further be modified by addition of a 5 amino acid extension to the N-terminus of ABDCon or ABDCon variant. The 5 amino acid extension may consist of an amino acid sequence TIDEWL (SEQ ID NO: 43), or any amino acid sequence shown in SEQ ID NOs: 42 or 45-55. Incorporating the N-terminal 5 amino acid extension into the ABDCon and variants thereof can increase the stability of the molecule. The N-terminal 5 amino acid extension may be structurally ordered as part of the first alpha helix of ABDCon and variants. The improved stability of the N-terminally extended molecules may therefore result from stabilizing the overall structure of the helix. The N-terminal ABDCon variants can be made using standard methods and their stability, for example thermal stability, assessed as described herein. Any albumin binding domain (ABD) may be modified with the addition of the 5 N-terminal amino acids to stabilize the ABD structure and improve stability, such as thermal stability of the resulting molecule.

ABDCon and variants thereof can be further modified at residues not affecting binding to albumin for the purpose of for example improving stability, reducing immunogenicity, improving solubility, or any other suitable characteristics. In one way to achieve this goal, the ABDCon and variants thereof can be optionally prepared by a process of analysis of the parental sequences and various conceptual engineered products using three-dimensional models of the parental and engineered sequences. Three-dimensional models are commonly available and are familiar to those skilled in the art. Computer programs are available which illustrate and display probable three-dimensional conformational structures of selected candidate sequences and can measure possible immunogenicity (e.g., Immunofilter program of Xencor, Inc. of Monrovia, Calif.). Inspection of these displays permits analysis of the likely role of the residues in the functioning of the candidate sequence, for example, residues that influence stability of the ABDCon domain. In this way, residues can be selected and combined from the parent and reference sequences so that the desired characteristics, such as improved stability is achieved. Alternatively, or in addition to the above procedures, other suitable methods of engineering can be used as known in the art.

Desirable physical properties of albumin binding domains of the invention include high thermal stability and reversibility of thermal folding and unfolding. Several methods have been applied to increase the apparent thermal stability of proteins and enzymes, including rational design based on comparison to highly similar thermostable sequences, design of stabilizing disulfide bridges, mutations to increase alpha-helix propensity, engineering of salt bridges, alteration of the surface charge of the protein, directed evolution, and composition of consensus sequences (Lehmann and Wyss, Curr Opin Biotechnol 12:371-375, 2001). High thermal stability may increase the yield of the expressed protein, improve solubility or activity, decrease immunogenicity, and minimize the need of a cold chain in manufacturing.

Residues that can be substituted to improve any characteristics of the albumin binding domains the invention can be determined by making the substitution and assaying for the desired characteristics of the albumin binding domain. For example, alanine scanning may be employed to identify positions in ABDCon and variants thereof that may affect the stability of the albumin binding domain.

In terms of loss of stability, i.e., “denaturing” or “denaturation” of a protein, is meant the process where some or all of the three-dimensional conformation imparting the functional properties of the protein has been lost with an attendant loss of activity and/or solubility. Forces disrupted during denaturation include intramolecular bonds, for example, electrostatic, hydrophobic, Van der Waals forces, hydrogen bonds, and disulfides. Protein denaturation can be caused by forces applied to the protein or a solution comprising the protein, such as mechanical force (for example, compressive or shear-force), thermal, osmotic stress, change in pH, electrical or magnetic fields, ionizing radiation, ultraviolet radiation and dehydration, and by chemical denaturants.

Measurement of protein stability and protein lability can be viewed as the same or different aspects of protein integrity. Proteins are sensitive or “labile” to denaturation caused by heat, by ultraviolet or ionizing radiation, changes in the ambient osmolarity and pH if in liquid solution, mechanical shear force imposed by small pore-size filtration, ultraviolet radiation, ionizing radiation, such as by gamma irradiation, chemical or heat dehydration, or any other action or force that may cause protein structure disruption. The stability of the molecule can be determined using standard methods. For example, the stability of a molecule can be determined by measuring the thermal melting (T_(m)) temperature, the temperature in ° Celsius (° C.) at which half of the molecules become unfolded, using standard methods. Typically, the higher the T_(m), the more stable the molecule. In addition to heat, the chemical environment also changes the ability of the protein to maintain a particular three dimensional structure.

Chemical denaturation can likewise be measured by a variety of methods. Chemical denaturants include guanidinium hydrochloride, guanadinium thiocyanate, urea, acetone, organic solvents (DMF, benzene, acetonitrile), salts (ammonium sulfate lithium bromide, lithium chloride, sodium bromide, calcium chloride, sodium chloride); reducing agents (e.g. dithiothreitol, beta-mercaptoethanol, dinitrothiobenzene, and hydrides, such as sodium borohydride), non-ionic and ionic detergents, acids (e.g. hydrochloric acid (HCl), acetic acid (CH₃COOH), halogenated acetic acids), hydrophobic molecules (e.g. phosopholipids), and targeted denaturants. Quantitation of the extent of denaturation can rely on loss of a functional property, such as ability to bind a target molecule, or by physiochemical properties, such as tendency to aggregation, exposure of formerly solvent inaccessible residues, or disruption or formation of disulfide bonds.

The ABDCon binding domain and variants thereof may be operably linked to a bioactive agent. Exemplary bioactive agents are peptides and proteins that may be operably linked to ABDCon and variants thereof using well known linkers, for example a linker containing poly-glycine, glycine and serine (Gly-Ser linker), or alanine and proline. The use of naturally occurring as well as artificial peptide linkers is well known in the literature (Hallewell et al., J Biol Chem 264:5260-5268, 1989; Alfthan et al., Protein Eng. 8:725-731, 1995; Robinson & Sauer, Biochemistry 35:109-116, 1996; U.S. Pat. No. 5,856,456). The bioactive agent may be linked to the ABDCon or variant thereof from its C- or N-terminus Multi-specific bioactive agents may also be linked to ABDCon. In these cases, ABDCon may be linked to the N-terminus or C-terminus of the molecule. ABDCon may also be positioned internally in such a multispecific agent such that it is linked to the C-terminus of one agent and the N-terminus of another. Bioactive agents may also be coupled to the albumin binding domains of the invention using chemical crosslinking well known in the art, for example using hydrazone or semicarbazone linkage. Exemplary bioactive agents are proteins specifically binding a target antigen such as proteins identified from fibronectin type III (FN3) repeat protein libraries, such as Tencon25-based libraries described in WO2011/137319A2 and WO2010/093627A2.

Additional moieties may be incorporated into ABDCon or variants thereof of the invention, such as toxin conjugates, polyethylene glycol (PEG) molecules, such as PEG5000 or PEG20,000, fatty acids and fatty acid esters of different chain lengths, for example laurate, myristate, stearate, arachidate, behenate, oleate, arachidonate, octanedioic acid, tetradecanedioic acid, octadecanedioic acid, docosanedioic acid, and the like, polylysine, octane, carbohydrates (dextran, cellulose, oligo- or polysaccharides) for desired properties. These moieties may be direct fusions with the ABDCon coding sequences and may be generated by standard cloning and expression techniques. Alternatively, well known chemical coupling methods may be used to attach the moieties to recombinantly produced ABDCon of the invention.

ABDCon and variants thereof, as well as fusion proteins of bioactive agents and ABDCon can be assessed for their half life using well known pharmacokinetic properties in in vivo models. Exemplary ABDCon and variants thereof bind albumin with K_(D) of about between 1 pM-1 μM, between 75 pM-860 nM, between 100 pM-500 nM, or between 1 nM-100 nM.

Generation and Production of ABDCon and Variants Thereof

Generation of the albumin binding domains of the invention is typically achieved at the nucleic acid level using standard methods. ABDCon variants having substituted codons at one or more specific residues can be synthesized for example using standard PCR cloning methods, or chemical gene synthesis according to methods described in U.S. Pat. No. 6,521,427 and U.S. Pat. No. 6,670,127, or using Kunkel mutagenesis (Kunkel et al., Methods Enzymol 154:367-382, 1987). If randomized codons are to be used for any residue positions, randomization can be accomplished using well known methods, for example degenerate oligonucleotides matching the designed diversity, or for example using NNK codons, which encode all 20 naturally occurring amino acids. In other diversification schemes, DVK codons can be used to encode amino acids Ala, Trp, Tyr, Lys, Thr, Asn, Lys, Ser, Arg, Asp, Glu, Gly, and Cys. Alternatively, NNS codons can be used to give rise to all 20 amino acid residues and simultaneously reducing the frequency of stop codons. The codon designations are according to the well known IUB code.

Synthesis of oligonucleotides with selected nucleotide “degeneracy” at certain positions is well known in that art, for example the TRIM approach (Knappek et al., J Mol Biol 296:57-86, 1999; Garrard & Henner, Gene 128:103-109, 1993). Such sets of nucleotides having certain codon sets can be synthesized using commercially available nucleotide or nucleoside reagents and apparatus.

Standard cloning and expression techniques are used to clone ABDCon or variants thereof into a vector or synthesize double stranded cDNA of ABDCon to express, or to translate the protein in vitro. Bioactive agents can be operably linked to ABDCon or variants thereof using well known methods.

Nucleic Acid Molecules and Vectors

The invention provides for nucleic acids encoding ABDCon or variants thereof of the invention as isolated polynucleotides or as portions of expression vectors or as portions of linear DNA sequences, including linear DNA sequences used for in vitro transcription/translation, vectors compatible with prokaryotic, eukaryotic or filamentous phage expression, secretion and/or display of the compositions. Certain exemplary polynucleotides are disclosed herein, however, other polynucleotides which, given the degeneracy of the genetic code or codon preferences in a given expression system, encode ABDCon or variants thereof of the invention are also within the scope of the invention.

The polynucleotides of the invention may be produced by chemical synthesis, such as solid phase polynucleotide synthesis on an automated polynucleotide synthesizer and assembled into complete single or double stranded molecules. Alternatively, the polynucleotides of the invention may be produced by other techniques, such as a PCR followed by routine cloning. Techniques for producing or obtaining polynucleotides of a given known sequence are well known in the art.

The polynucleotides of the invention may comprise at least one non-coding sequence, such as a promoter or enhancer sequence, intron, polyadenylation signal, and the like. The polynucleotide sequences may also comprise additional sequences encoding additional amino acids that encode for example a marker or a tag sequence such as a hexa-histidine or an HA tag to facilitate purification or detection of the protein, a signal sequence, a fusion protein partner, such as cDNA encoding a bioactive agent, and the like.

An exemplary polynucleotide comprises sequences encoding ABDCon, sequences for a ribosome binding site, promoter sequence, terminator sequence, antibiotic resistance gene, and a bacterial origin of replication (ori). Exemplary polynucleotides encoding albumin binding domains of the invention are shown in SEQ ID NO: 35.

Another embodiment of the invention is a vector comprising at least one polynucleotide of the invention. Such vectors may be plasmid vectors, viral vectors, vectors for baculovirus expression, transposon based vectors or any other vector suitable for introduction of the polynucleotides of the invention into a given organism or genetic background by any means. Such vectors may be expression vectors comprising nucleic acid sequence elements that can control, regulate, cause or permit expression of a polypeptide encoded by such a vector. Such elements may comprise transcriptional enhancer binding sites, RNA polymerase initiation sites, ribosome binding sites, and other sites that facilitate the expression of encoded polypeptides in a given expression system. Such expression systems may be cell-based, or cell-free systems well known in the art.

Host Cell Selection or Host Cell Engineering

ABDCon and variants thereof of the present invention can be optionally produced by a cell line, a mixed cell line, an immortalized cell or clonal population of immortalized cells, as well known in the art. See, e.g., Ausubel, et al., ed., Current Protocols in Molecular Biology, John Wiley & Sons, Inc., NY, N.Y. (1987-2001); Sambrook, et al., Molecular Cloning: A Laboratory Manual, 2^(nd) Edition, Cold Spring Harbor, N.Y. (1989); Harlow and Lane, Antibodies, a Laboratory Manual, Cold Spring Harbor, N.Y. (1989); Colligan, et al., eds., Current Protocols in Immunology, John Wiley & Sons, Inc., NY (1994-2001); Colligan et al., Current Protocols in Protein Science, John Wiley & Sons, NY, N.Y., (1997-2001).

The host cell chosen for expression may be of mammalian origin or may be selected from COS-1, COS-7, HEK293, BHK21, CHO, BSC-1, Hep G2, 653, SP2/0, HeLa, myeloma, lymphoma, yeast, insect or plant cells, or any derivative, immortalized or transformed cell thereof. Alternatively, the host cell may be selected from a species or organism incapable of glycosylating polypeptides, e.g. a prokaryotic cell or organism, such as BL21, BL21(DE3), BL21-GOLD(DE3), XL1-Blue, JM109, HMS174, HMS174(DE3), and any of the natural or engineered E. coli spp, Klebsiella spp., or Pseudomonas spp strains.

Uses of Albumin Binding Domains of the Invention

The compositions of the non-natural albumin binding domain ABDCon and variants thereof of the invention can be used to modulate the half life and/or biodistribution of a bioactive agent within the tissue of an animal by operably linking ABDCon and the bioactive agent, and wherein the administration of the composition to an animal results in a half life and/or biodistribution of the bioactive agent which is different from the tissue distribution obtained upon administration of the active agent alone.

Pharmaceutical Compositions Comprising ABDCon or Variants Thereof

The ABDCon or variants thereof binding albumin operably linked to bioactive agents can be isolated using separation procedures well known in the art for capture, immobilization, partitioning, or sedimentation, and purified to the extent necessary for commercial applicability.

For therapeutic use, the bioactive molecule-ABDCon fusion proteins may be prepared as pharmaceutical compositions containing an effective amount of the bioactive agent-ABDCon fusion protein as an active ingredient in a pharmaceutically acceptable carrier. The term “carrier” refers to a diluent, adjuvant, excipient, or vehicle with which the active compound is administered. Such vehicles can be liquids, such as water and oils, including those of petroleum, animal, vegetable or synthetic origin, such as peanut oil, soybean oil, mineral oil, sesame oil and the like. For example, 0.4% saline and 0.3% glycine can be used. These solutions are sterile and generally free of particulate matter. They may be sterilized by conventional, well-known sterilization techniques (e.g., filtration). The compositions may contain pharmaceutically acceptable auxiliary substances as required to approximate physiological conditions such as pH adjusting and buffering agents, stabilizing, thickening, lubricating and coloring agents, etc. The concentration of the bioactive agent-ABDCon fusion protein in such pharmaceutical formulation can vary widely, i.e., from less than about 0.5%, usually at or at least about 1% to as much as 15 or 20% by weight and will be selected primarily based on required dose, fluid volumes, viscosities, etc., according to the particular mode of administration selected. Suitable vehicles and formulations are described, for example, in e.g. Remington: The Science and Practice of Pharmacy, 21^(st) Edition, Troy, D. B. ed., Lipincott Williams and Wilkins, Philadelphia, Pa. 2006, Part 5, Pharmaceutical Manufacturing pp 691-1092, See especially pp. 958-989.

The mode of administration for therapeutic use of the bioactive agent-ABDCon fusion protein may be any suitable route that delivers the agent to the host, such as parenteral administration, e.g., intradermal, intramuscular, intraperitoneal, intravenous or subcutaneous, pulmonary; transmucosal (oral, intranasal, intravaginal, rectal); using a formulation in a tablet, capsule, solution, suspension, powder, gel, particle; and contained in a syringe, an implanted device, osmotic pump, cartridge, micropump; or other means appreciated by the skilled artisan, as well known in the art. Site specific administration may be achieved by for example intrarticular, intrabronchial, intraabdominal, intracapsular, intracartilaginous, intracavitary, intracelial, intracerebellar, intracerebroventricular, intracolic, intracervical, intragastric, intrahepatic, intracardial, intraosteal, intrapelvic, intrapericardiac, intraperitoneal, intrapleural, intraprostatic, intrapulmonary, intrarectal, intrarenal, intraretinal, intraspinal, intrasynovial, intrathoracic, intrauterine, intravascular, intravesical, intralesional, vaginal, rectal, buccal, sublingual, intranasal, or transdermal delivery.

While having described the invention in general terms, the embodiments of the invention will be further disclosed in the following examples that should not be construed as limiting the scope of the claims.

Example 1 Generation of Non-Natural Albumin Binding Domain Design of Non-Natural Albumin Binding Domain Consensus (ABDCon)

A non-natural albumin binding domain (ABD) was designed by calculating a consensus amino acid sequence of 3-helix bundle ABD sequences deposited in the non-redundant protein database. In order to determine the consensus sequence, the ABD from Streptococcus sp. G148 protein G (SEQ ID NO: 1) was used as a template sequence for a BLAST search against the non-redundant NCBI protein database (http://blast_ncbi_nlm_nih_gov/Blast_cgi). All default settings were used for the BLAST search; Expect threshold=10, word size=3, matrix=BLOSUM62, gap costs=existence 11, extension 1, and compositional adjustments=conditional compositional score matrix adjustment. From this search, the 20 most closely related protein domains, listed in Table 1 (SEQ ID NOs: 1-20), were selected to be included in a multiple sequence alignment in order to determine a consensus. Only non-redundant sequences were selected. Several protein accession numbers are listed multiple times in Table 1, indicative that some proteins contain several closely related ABD domains. SEQ ID NO:4 is a non-natural ABD derived by phage display and gene shuffling. (He et al., Protein Sci 16::1490-1494, 2007).

TABLE 1 Protein Domain SEQ ID Accession Number NO: P19909 1 AAA26847 2 AAA26847 3 2FS1_A 4 YP_002123072 5 ZP_07321229 6 ZP_07321229 7 AAA67503 8 AAA67503 9 AAA67503 10 AAA67503 11 ZP_07734934 12 ZP_06946534 13 ZP_07321240 14 ZP_07906833 15 Q51911 16 YP_001692809 17 ZP_07702676 18 ZP_07702676 19 ZP_07268895 20

A multiple sequence alignment was generated from the sequences listed using AlignX software (using all default settings). The sequence alignment shown in Table 6 was used to select the most prevalent amino acid at each sequence position to derive the albumin binding domain consensus sequence, ABDCon (SEQ ID NO: 21, Table 6). Tyr was chosen instead of Ile for position 21 as there was no clear consensus for this position and aromatic residues Tyr and Phe were well represented. Pairwise sequence identities between ABDCon range from 45% (SEQ ID NO: 3) to 82% (SEQ ID NOs: 8, 13, 14, and 16).

Gene Synthesis

The amino acid sequence of the albumin binding domain consensus (ABDCon) was back translated into a nucleic acid sequence encoding for ABDCon using preferred codons for E. coli expression as below (SEQ ID NO: 35) and a synthetic gene produced (BlueHeron Biotechnologies). 5′ and 3′ DNA sequences were added to the synthetic gene sequence of SEQ ID NO: 34 in order to add NdeI and XhoI sites for subcloning, as well as DNA sequences encoding for an N-terminal 8-His tag for protein purification. This gene was cloned into a pET26 vector (Novagen) for expression driven by a T7 promoter sequence and transformed into E. coli strain BL21(DE3) (Novagen).

Expression and Purification

For expression of ABDcon, 50 mL of LB media supplemented with 30 μg/mL kanamycin was inoculated with 1 colony and grown overnight at 37° C., 220 rpm shaking. The next day, 10 mL of the overnight culture was added to 100 mL of Terrific Broth supplemented with 30 μg/mL kanamycin and the culture grown at 37° C., 220 rpm for 2.5 hours. IPTG was added to a final concentration of 1 mM and the temperature reduced to 30° C. to induce protein expression. Cells were harvested 14 hours later by centrifugation at 4000×g for 20 minutes and the cell pellets stored at −20° C. Frozen cell pellets were resuspended in 5 mL of BugBuster HT (Novagen) per gram of wet pellet and gently mixed at room temperature for 30 minutes. The poly-histidine tagged ABDCon molecule was purified by Ni-NTA chromatography (GE Healthcare), eluting in a buffer of 50 mM sodium phosphate pH 7.4, 500 mM sodium chloride with a gradient of 10-250 mM imidazole. Fractions containing ABDCon were pooled and further purified by size exclusion chromatography using a Superdex75 16/60 column (GE Healthcare) with a mobile phase of PBS. Purity was assessed by SDS-PAGE analysis (FIG. 1). Mass spectrometry determined the mass to be 6383 Da, in agreement with the theoretical mass of 6382 Da (FIG. 2). Analytical size exclusion chromatography using a Superdex 75 5/150 column (GE Healthcare) shows that the ABDCon preparation is free of aggregates and elutes at a time consistent with a monomeric protein (FIG. 3).

Example 2 Characterization of ABDCon Thermal Stability of ABDCon

ABDCon was concentrated to 2.175 mg/mL in PBS pH 7.4 and the thermal stability assessed by differential scanning calorimetry (DSC). Stability data was generating by heating 400 μL of the ABDCon solution from 25° C. to 100° C. at a scan rate of 1° C. per minute in a VP-DSC instrument (MicroCal). A second identical scan was completed on the sample in order to assess the reversibility of thermal folding/unfolding. Data was fit to a 2-state unfolding model in order to calculate the melting temperature. FIG. 4A and FIG. 4B shows that ABDCon has a high thermal stability of 81.5° C. in PBS and that folding is fully reversible.

ABDCon Binding to Albumin

The kinetics of ABDCon binding to human serum albumin and mouse serum albumin were measured on a ProteOn™ XPR-36 Protein Interaction Array System (Bio-Rad) using GLC sensor chips. Human (SEQ ID NO: 36), Rhesus (SEQ ID NO: 37), and murine (SEQ ID NO: 38) serum albumins were purchased from Sigma (Catalogue # A4327 for human, #A3559 for murine, and #A4297 for rhesus) and resuspended in PBS at different concentrations. Each serum albumin was directly immobilized on a ligand channel in the vertical orientation of a GLC chip via standard amine coupling at 2.1 μg/mL at pH 5.0 to obtain surfaces with ligand densities of 500-1000 resonance units. Binding of recombinant ABDCon was tested by flowing five different concentrations (e.g. 1 μM diluted in a 3-fold concentration series) as analytes simultaneously in the horizontal orientation over the immobilized serum albumin surfaces. The dissociation phases for all concentrations were monitored for two hours at a flow rate of 100 μL/min using PBST (PBS, 0.005% Tween20) as running buffer. A sixth sample (buffer only) was injected to monitor the baseline stability. The surfaces were regenerated using 1 short pulse (18 μL) of 0.8% phosphoric acid.

TABLE 2 Albumin k_(on) (1/Ms) k_(off) (1/s) K_(D)(M) Human 4.04E+05 3.02E−05 7.48E−11 Mouse 2.41E+05 7.76E−04 3.22E−09 Rhesus 1.13E+06 6.78E−05 6.01E−11 The raw response data were first processed by subtracting the buffer only responses and the non-specific binding between the analytes and the chip. Processed data of all five concentrations were globally fit to a 1:1 simple langmuir binding model for each ligand surface. Table 2 describes the binding kinetics determined for each species of albumin. Serum Half-Life of ABDCon Fusion in Mice

The ability of ABDCon to extend the serum half-life of a fusion protein was evaluated by producing a synthetic gene encoding a fusion of ABDCon to the c-terminus of Tencon25. Tencon25 is a protein scaffold based on a consensus sequence of a fibronectin type III (FN3) repeat protein having a sequence shown in residues 1-90 of SEQ ID NO: 39, and described in US2011/0274623A1. Tencon25 and ABDCon protein domains were fused by a (G₄S)₂ peptide linker (SEQ ID NO: 40). The resulting fusion protein has a polypeptide sequence shown in SEQ ID NO: 39. A poly-histidine tag was incorporated at the C-terminus for purification purposes. Sixty nine BALB/c female mice were split into 3 groups (N=3 group 1 non-treated control, and N=33 groups 2-3). Mice were treated with a single intravenous dose of the Tencon25-ABDCon fusion protein at 2 mg/kg. The dosing was based upon the weight of the animals on the day of administration. The mice were euthanized at the following time points after the injection: 10 min, 30 min, 1, 4, 6, hours, and 1, 2, 3, 7, 10, 14 days. Blood samples were taken from each animal via cardiac puncture. The blood samples were allowed to clot at room temperature for 30 minutes, but no longer than 1 hour. The blood samples were then centrifuged at approximately 3,500 rpm for 15 minutes. Serum samples were analyzed using a homogenous sandwich ELISA on the Mesoscale Discovery platform. Streptavidin-Gold plates (Mesoscale Discovery) were blocked for 1 hour with Superblock (TBS) Tween-20 (Thermo). Polyclonal anti-Tencon25 antibody was used for both capture (biotinylated) and detection (labeled with MSD Sulfo-Tag (Mesoscale Discovery)) at 0.625 m/ml. The antigen and antibodies were added to the plates, which were incubated for 2 hours with vigorous shaking at RT. Plates were washed with TBS-T (Sigma) and MSD Read Buffer with Surfactant (Mesoscale Discovery) was added. The plates were read using the MSD Sector Imager 6000. Data was analyzed using GraphPad Prism. Previous studies have shown that a similar, unfused Tencon molecule is cleared from the bloodstream quickly with a serum half-life of approximately 20 minutes in mice. Fusion of Tencon25 to ABDCon extends the serum half-life to over 60 hours (FIG. 5).

Example 3 Engineering of ABDCon for Varying Affinity to Serum Albumin

The affinity of binding to serum albumin can dictate not only the serum half-life of a therapeutic protein but also the ability of that molecule to bind and neutralize its target. For example, a molecule that binds serum albumin too weakly will have short serum half-life due to renal filtering while not bound to albumin (Hopp et al., Protein Eng Des Sel 23:827-834, 2010). On the contrary, a molecule that binds to albumin too tightly will not be released from albumin at the preferred site of action and thus may be unable to neutralize the desired target in some cases. It is therefore preferable to achieve half-life extension via albumin binding in a way in which the albumin interaction is only tight enough to give the desired serum half-life. As the ABDCon sequence described herein binds to human serum albumin with an affinity of 75 pM and an off-rate of 3.02×10¹⁵ 1/s (under experimental conditions described herein), molecules fused to ABDCon will be largely bound to albumin once administered to an animal or patient. For some targets and fusions, it may be desirable to be bound less tightly to serum albumin. Ten mutant versions of ABDCon were designed to lower the binding affinity of ABDCon for albumin. Table 3 summarizes these mutants:

TABLE 3 SEQ ID Construct Mutation* NO: Rationale ABDCon2 Y21A 25 Disrupt aromatic stacking with albumin residues F309 and F326 ABDCon3 Y21K 26 Disrupt aromatic stacking with albumin residues F309 and F326. Insert steric clash ABDCon4 Y22A 27 Decrease hydrophobic contacts ABDCon5 Y22S 28 Decrease hydrophobic contacts ABDCon6 Y22V 29 Decrease hydrophobic contacts slightly ABDCon7 E41Q 30 Remove charge to disrupt salt-bridge ABDCon8 K30D 31 Alter charge to disrupt salt-bridge ABDCon9 T31A 32 Remove intermolecular hydrogen bond ABDCon10 A37V 33 Introduce steric clash ABDCon11 A37Y 34 Introduce steric clash *Amino acid numbering according to SEQ ID NO: 21

ABDCon mutants were selected by examining the crystal structure of the GA module (protein G-related albumin binding module) bound to human serum albumin (PDB code 1TF0) (Lejon et al., Acta Crystallogr Sect F Struct Biol Cryst Commun 64:64-69, 2008) and making the assumption that ABDCon binds to albumin in a manner very similar to GA. Mutants were designed to decrease the affinity of ABDCon for albumin by disrupting hydrophobic contacts, introducing steric clashing, disrupting salt bridges, and disrupting hydrogen bonding (Table 3). Changes introduced were designed to decrease binding affinity without changing the binding surface so dramatically that binding was abolished. Each mutant was expressed and purified from E. coli as described for ABDCon. Mutant ABDCon11 was found to be insoluble and thus excluded from further analysis. The affinities of each variant for human, mouse, and rhesus serum albumin were determined by surface plasma resonance and shown in Table 4. In addition to the positions listed in Table 3, residues L25, E33, G34, L38, I42, and A45 may be mutated to increase or decrease the affinity of ABDCon for albumin as they are predicted to form direct contacts with albumin.

TABLE 4 K_(D) Human K_(D) Mouse K_(D) Rhesus Variant (nM) (nM) (nM) ABDCon2 6.1 ND 17.5 ABDCon3 1 103 5.1 ABDCon4 13.6 571 43.7 ABDCon5 54.8 1650 104 ABDCon6 2.7 190 8.4 ABDCon7 0.4 20.6 0.5 ABDCon8 43 1001.7 65 ABDCon9 860 550.8 206 ABDCon10 0.8 37.8 1.3

Example 4 Serum Half-Life of ABDCon Variant Fusion Proteins

Tencon25 (amino acids 1-90 of SEQ ID NO: 39) was fused to ABDCon variants 3, 5, 7, and 9 (Table 3) in order to assess the correlation between ABDCon affinity for albumin with half-life extension. These molecules were dosed into mice at 2 mg/kg as described above in previous studies and analyzed using identical methods as described for the Tencon25-ABDCon fusions. A summary of the PK parameters obtained for these molecules is shown in Table 5 and FIG. 6. Here it is demonstrated that the rate of clearance decreases as affinity for albumin is increased until reaching an affinity of 103 nM at which point, no large differences in PK parameters are obtained. The data in Table 5 demonstrate the ability to tune properties such as half-life, rate of clearance, and total exposure (AUC) by varying the affinity of the ABDCon molecule for albumin.

TABLE 5 Volume of T (½) Distribution Clearance AUC Construct K_(D) (nM) (hours) (mL/kg) (mL/hr/kg) (hour * ng/ml) Tencon25-ABDCon 1.86 60.42 83.2873 0.9555 2054332 Tencon25-ABDCon3 103 45.8355 60.645 0.915 2166461 Tencon25-ABDCon5 1650 32.8715 223.86 4.72 423635 Tencon25-ABDCon7 20.6 46.7499 49.915 0.74 2681641 Tencon25-ABDCon9 550.8 34.1956 81.87 1.66 1205061

TABLE 6 SEQ ID Sequence 13 LKEAKEKAVEELKENGITSEKYIDQINKAKTVEGVNALKDEIIKA 14 LKEAKEKAVEELKNNGITSEKYIDQINKAKTVEGVNALKDEIIKA 17 LKEAKEKAVEELKNNGITSEKYIEQINKAKTVEGVNALKDEIIKA 20 LKEAKEKAIEELKNNGITSEKYIEQINKAKTVEGVNALKDEIIKS 7 LKDAKEKAIEAIRKEGVKSKLYEDLINKAKTIDGVNALRDQIIEA 1 LAEAKVLANRELDKYGVS-DYYKNLINNAKTVEGVKALIDEILAA 2 LSEAKEMAIRELDAQGVS-DFYKNKINNAKTVEGVVALKDLILNS 3 LDQAKQAALKEFDRYGVS-NYYKNLINKAKTVEGIMELQAQVV-- 12 LAEAKKVAHEEFTKAGITGKIFHDAIDAAKTVEGLKAYVAETLAA 15 LAEAKKVAHEEFTKAGITGKIFHDAIDAAKTVEGLQAYVAETLAA 18 LAEAKNVAHAEFTKAGITGKIFHDAIDAAKTVEGLQAYVAETLAA 19 LAEAKKAAHEEFTKAGITGKIFHDAIDAAKTVEGLQAYVAETLKA 4 LAQAKEAAIKELKQYGIG-DYYIKLINNAKTVEGVESLKNEILKA 5 LLKAKEAAINELKQYGIS-DYYVTLINKAKTVEGVNALKAEILSA 16 LKNAKEDAIAELKKAGITSDFYFNAINKAKTVEEVNALKNEILKA 10 LKNAKEDAIKELKEAGISSDIYFDAINKAKTVEGVEALKNEILKA 11 LKNAKEDAIKELKEAGITSDIYFDAINKAKTIEGVEALKNEILKA 6 LKNAKEDAIKELKEAGIKSQFFFNLINNAKTVEGVESLKNEILKA 9 LKNAKEAAIKELKEAGITAEYLFNLINKAKTVEGVESLKNEILKA 8 LKNAKEEAIKELKEAGITSDLYFSLINKAKTVEGVEALKNEILKA 21 LKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKA

Example 5 Stabilization of Albumin Binding Domains

Studies were completed to determine the stability of ABDCon (SEQ ID NO: 21) when produced as fusion proteins with various fibronectin type III (FN3) domains (see, for example, U.S. Pat. Publ. No. US2010/0216708). The FN3 domain-ABDCon fusion proteins were generated using standard cloning techniques. The amino acid sequence of one of the fusion proteins, Tencon-ABDCon is shown in SEQ ID NO: 41. Other FN3 domain-ABDCon fusion proteins made were Tencon25-ABDCon, 83-ABDCon and 71-ABDCon. These proteins were produced with c-terminal poly histidine tags and purified by a combination of nickel affinity and size exclusion chromatography using standard methods. Each purified molecule was incubated in PBS pH 7.4 at 37° C. for 28 days before analysis by SDS-PAGE and Mass Spectrometry. FIG. 7A demonstrates that each FN3 domain-ABDCon fusion protein was found to be degraded during this incubation as evidenced by the appearance of low molecular weight bands on the SDS-PAGE gel. Mass Spectrometry analysis confirmed that the main degradation pattern was clipping of these molecules at residues L1, K2, and E3 of the ABDCon sequence (SEQ ID NO: 21). In addition, it was observed that the native Streptococcus protein G ABD (SEQ ID NO: 1) fused to a FN3 domain displayed a similar degradation pattern with clipping at residue L1 when incubated at 4° C. for 6-8 months (Data not shown). Finally, several purified lots of native ABD (SEQ ID NO:1) and ABDCon (SEQ ID NO: 21) were observed to be inactive and undetectable in solution by SDS-PAGE once stored for several months at 4° C., indicative of severe degradation.

The above observations suggested that the N-terminal alpha helix of the ABDCon and native ABD structures as used for serum half-life extension are unstable. This lack of stability for such fusion proteins is undesirable as it potentially limits the shelf-life of such molecules for research as well as therapeutic applications. As such, a strategy was developed to improve the stability of these molecules. Analysis of the three dimensional structures of albumin binding domains deposited in the Protein Data Bank shows that the amino acid sequence TIDQWL (SEQ ID NO: 42) found N-terminal to the start of the native ABD (SEQ ID NO: 1) is structurally ordered as part of the first alpha helix of this molecule in several crystal structures (e.g. PDB 2VDB, Acta Cryst 2008 F64, 64-69). This is in contrast to the original NMR structure of the ABD which showed this region to be disordered in solution (PDB 1GAB, Johansson et al., J. Mol. Biol. 266: 859-865 1997). Thus, it was hypothesized that extending this first alpha helix of the ABD and ABDCon sequences could impart greater stability to this region as extending alpha helices can impart greater stability to such a helix (Su et al., Biochemistry 33:15501-15510, 1994).

A multiple sequence alignment of the natural albumin binding domains presented in Table 1 revealed no clear consensus sequence for these N-terminal residues. However one peptide sequence, TIDEWL (SEQ ID NO: 43), is present N-terminal to 5 of these protein domains. Thus, a new ABDCon construct, ABDCon12 (SEQ ID NO: 44), was generated by adding the TIDEWL sequence to the N-terminus of ABDCon. This protein was expressed with an N-terminal poly histidine tag and purified to homogeneity using standard methods for nickel affinity chromatography and size exclusion chromatography. Purified ABDCon12 was incubated at 37° C. in PBS for 28 days and stability assessed by SDS-PAGE and mass spectrometry. SDS-PAGE showed a slightly faster migration pattern after day 14 indicative of degradation. Total mass analysis however demonstrates that this degradation is occurring exclusively in the polyhistidine tag and not in the ABDCon12 sequence, indicating that the TIDEWL sequence improved the stability of ABDCon. Further proof of stability was demonstrated in the stability of a generated FN3 domain-ABDCon12 fusion protein (FIG. 7B) which showed significantly less degradation products compared to the original FN3 domain-ABDCon molecules (FIG. 7A) when incubated at 37° C. in PBS for 28 days.

The melting temperature of ABDCon12 was determined by differential scanning calorimetry using the procedures outlined in Example 2 above in order to investigate the mechanism of stabilization for this molecule. A melting temperature of 90.9° C. was obtained in PBS, a 9.4° increase compared to the original ABDcon molecule, suggesting that the decrease in proteolysis/degradation observed for ABDCon12 and ABDCon12 fusion proteins is a result of increased conformational stability afforded by the extension of the N-terminal alpha helix.

SEQ ID NO 41: Tencon-ABDCon MLPAPKNLVVSEVTEDSLRLSWTAPDAAFDSFLIQYQESEKVGEAINLT VPGSERSYDLTGLKPGTEYTVSIYGVKGGHRSNPLSAEFTTGGGGSGGG GSLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEILKAGG HHHHHH SEQ ID NO: 42: N-terminal sequence of Strep G. ABD TIDQWL SEQ ID NO: 43: N-terminal sequence appended to ABDCon TIDEWL SEQ ID NO: 44: ABDCon12 TIDEWLLKEAKEKAIEELKKAGITSDYYFDLINKAKTVEGVNALKDEIL KA

Example 6 Characterization of ABDCon12

The affinity of purified ABDCon12 binding to human and murine albumin was determined by surface plasmon resonance using the same methods as described above in Example 2. Dissociation constants of 0.7 nM and 8.2 nM were obtained for ABDCon12 binding to human and murine albumin, respectively. The ability of ABDCon12 to extend the serum half-life of a fusion molecule was demonstrated by fusing ABDCon12 to the C-terminus of an FN3 domain specifically binding an antigen. This molecule was administered to mice by IP injection at 2 mg/kg and analyzed as described above in Example 4. A terminal half-life of 55 hours was measured for the FN3 domain-ABDCon12 fusion protein.

Example 7 Stabilizing Albumin Binding Domains

Based on sequence analysis of naturally occurring albumin binding domains, it is anticipated that other sequences added to the N-terminus of albumin binding domains may make them more stable. For example, a number of different sequences are found N-terminal to these natural albumin binding domains such as but not limited to APAVDV (SEQ ID NO: 45), IAKEKA (SEQ ID NO: 46), TIDQWL (SEQ ID NO: 42), VPAADV (SEQ ID NO: 47), TVKSIE (SEQ ID NO: 48), TPAVDA (SEQ ID NO: 49), TLKSIK (SEQ ID NO: 50), WEKAAA (SEQ ID NO: 51), AVDANS (SEQ ID NO: 52), QLAAEA (SEQ ID NO: 53), ALKAAA (SEQ ID NO: 54), EKLAAA (SEQ ID NO: 55). Addition of these sequences to albumin binding domains might increase stability if these sequences produce longer alpha helices. In addition, non-natural peptides that increase alpha helix length or stability are predicted to stabilize albumin binding domains as well.

Variants with additional N-terminal sequences can be generated using standard techniques and their properties tested as described supra.

It will be clear that the invention can be practiced otherwise than as particularly described in the foregoing description and examples. Numerous modifications and variations of the present invention are possible in light of the above teachings and, therefore, are within the scope of the appended claims. 

What is claimed:
 1. An isolated non-natural albumin binding domain, comprising an amino acid sequence selected from the group consisting of SEQ ID NOs: 21-34.
 2. The isolated albumin binding domain of claim 1, further comprising an extension of 5 amino acids at its N-terminus.
 3. The isolated albumin binding domain of claim 2, wherein the extension of 5 amino acids comprises an amino acid sequence of SEQ ID NOs: 42, 43 or 45-55.
 4. The isolated albumin binding domain of claim 2, wherein the extension of 5 amino acids comprises an amino acid sequence of SEQ ID NO:
 43. 5. The isolated albumin binding domain of claim 2, comprising the amino acid sequence of SEQ ID NO:44.
 6. A fusion protein comprising an albumin binding domain of claim 1 and a bioactive agent selected from the group consisting of proteins, antibodies, peptides, nucleotides, and small molecule pharmaceuticals, wherein the bioactive agent specifically binds a target molecule.
 7. The fusion protein of claim 6, wherein the bioactive agent is a protein scaffold comprising a consensus sequence of a fibronectin type III (FN3) repeat protein.
 8. The fusion protein of claim 7, wherein the protein scaffold comprises amino acid residues 1-90 of SEQ ID NO:
 39. 9. The fusion protein of claim 7, wherein the albumin binding protein and the bioactive agent are operably linked by a linker.
 10. The fusion protein of claim 9, wherein the linker comprises a Gly-Ser linker.
 11. The fusion protein of claim 10, wherein the linker comprises an amino acid sequence of SEQ ID NO:
 40. 12. A pharmaceutical composition comprising the fusion protein of claim 6 and at least one pharmaceutically acceptable carrier or diluent. 